Search CORE

27 research outputs found

ELVIS: Entertainment-led video summaries

Author: Arthur G. Money
Babaguchi N.
Cacioppo J. T.
Damnjanovic U.
Furini M.
Greenwald M. K.
Harry Agius
Jaimes A.
Kim J.
Leonhardt S.
Millet C.
Money A. G.
Nasoz F.
Rikkard N. S.
Sebe N.
Shipman S.
Takahashi Y.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/08/2010
Field of study

© ACM, 2010. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in ACM Transactions on Multimedia Computing, Communications, and Applications, 6(3): Article no. 17 (2010) http://doi.acm.org/10.1145/1823746.1823751Video summaries present the user with a condensed and succinct representation of the content of a video stream. Usually this is achieved by attaching degrees of importance to low-level image, audio and text features. However, video content elicits strong and measurable physiological responses in the user, which are potentially rich indicators of what video content is memorable to or emotionally engaging for an individual user. This article proposes a technique that exploits such physiological responses to a given video stream by a given user to produce Entertainment-Led VIdeo Summaries (ELVIS). ELVIS is made up of five analysis phases which correspond to the analyses of five physiological response measures: electro-dermal response (EDR), heart rate (HR), blood volume pulse (BVP), respiration rate (RR), and respiration amplitude (RA). Through these analyses, the temporal locations of the most entertaining video subsegments, as they occur within the video stream as a whole, are automatically identified. The effectiveness of the ELVIS technique is verified through a statistical analysis of data collected during a set of user trials. Our results show that ELVIS is more consistent than RANDOM, EDR, HR, BVP, RR and RA selections in identifying the most entertaining video subsegments for content in the comedy, horror/comedy, and horror genres. Subjective user reports also reveal that ELVIS video summaries are comparatively easy to understand, enjoyable, and informative

Crossref

Brunel University Research Archive

Video Summarization Using Deep Semantic Features

Author: D Potapov
DG Lowe
G Evangelopoulos
M Gygli
N Babaguchi
N Ejaz
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/09/2016
Field of study

Computer Vision - ACCV 2016: 13th Asian Conference on Computer Vision, Nov 20-24, 2016, Taipei, TaiwanThis paper presents a video summarization technique for an Internet video to provide a quick way to overview its content. This is a challenging problem because finding important or informative parts of the original video requires to understand its content. Furthermore the content of Internet videos is very diverse, ranging from home videos to documentaries, which makes video summarization much more tough as prior knowledge is almost not available. To tackle this problem, we propose to use deep video features that can encode various levels of content semantics, including objects, actions, and scenes, improving the efficiency of standard video summarization techniques. For this, we design a deep neural network that maps videos as well as descriptions to a common semantic space and jointly trained it with associated pairs of videos and descriptions. To generate a video summary, we extract the deep features from each segment of the original video and apply a clustering-based summarization technique to them. We evaluate our video summaries using the SumMe dataset as well as baseline approaches. The results demonstrated the advantages of incorporating our deep semantic features in a video summarization technique

arXiv.org e-Print Archive

NAIST Academic Repository

Crossref

A Video Summarization Method for Basketball Game

Author: A. Ekin
N. Babaguchi
N. Otsu
R. Leonardi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

Crossref

Motion-Based Semantic Event Detection for Video Content Description in MPEG-7

Author: J. L. Lian
N. Babaguchi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Associating Cooking Video Segments with Preparation Steps

Author: H.D. Wactlar
N. Babaguchi
Y. Watanabe
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A Hierarchical Semantics-Matching Approach for Sports Video Annotation

Author: A. Ekin
C.L. Huang
N. Babaguchi
S.B. Needleman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Crossref

Video summarization using deep semantic features

Author: D Potapov
DG Lowe
G Evangelopoulos
M Gygli
N Babaguchi
N Ejaz
Publication venue: Springer Nature
Publication date: 27/09/2016
Field of study

Abstract This paper presents a video summarization technique for an Internet video to provide a quick way to overview its content. This is a challenging problem because finding important or informative parts of the original video requires to understand its content. Furthermore the content of Internet videos is very diverse, ranging from home videos to documentaries, which makes video summarization much more tough as prior knowledge is almost not available. To tackle this problem, we propose to use deep video features that can encode various levels of content semantics, including objects, actions, and scenes, improving the efficiency of standard video summarization techniques. For this, we design a deep neural network that maps videos as well as descriptions to a common semantic space and jointly trained it with associated pairs of videos and descriptions. To generate a video summary, we extract the deep features from each segment of the original video and apply a clustering-based summarization technique to them. We evaluate our video summaries using the SumMe dataset as well as baseline approaches. The results demonstrated the advantages of incorporating our deep semantic features in a video summarization technique

arXiv.org e-Print Archive

NAIST Academic Repository

Crossref

University of Oulu Repository - Jultika

Multimedia analysis for ecological data

Author: Babaguchi N.
et al not
Mezaris V
Ossenbruggen Jacco
Spampinato Concetto
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/11/2012
Field of study

Automatic Score Scene Detection for Baseball Video

Author: E. Sahouria
H.B. Nguyen
N. Babaguchi
R. Brunelli
T. Miyazaki
Y. Gong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Crossref

A scalable and extensible segment-event-object-based sports video retrieval system

Author: Adrien Joly
Assfalg J.
Assfalg J.
Assfalg J.
Babaguchi N.
Babaguchi N.
Chairsorn L.
Dian Tjondronegoro
Ekin A.
Han M.
Ngai C. H.
Pereira F.
Sato T.
Wu C.
Xie L.
Xu P.
Yi-Ping Phoebe Chen
Zeinik-Manor L.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2008
Field of study

Sport video data is growing rapidly as a result of the maturing digital technologies that support digital video capture, faster data processing, and large storage. However, (1) semi-automatic content extraction and annotation, (2) scalable indexing model, and (3) effective retrieval and browsing, still pose the most challenging problems for maximizing the usage of large video databases. This article will present the ﬁndings from a comprehensive work that proposes a scalable and extensible sports video retrieval system with two major contributions in the area of sports video indexing and retrieval. The ﬁrst contribution is a new sports video indexing model that utilizes semi-schema-based indexing scheme on top of an Object-Relationship approach. This indexing model is scalable and extensible as it enables gradual index construction which is supported by ongoing development of future content extraction algorithms. The second contribution is a set of novel queries which are based on XQuery to generate dynamic and user-oriented summaries and event structures. The proposed sports video retrieval system has been fully implemented and populated with soccer, tennis, swimming, and diving video. The system has been evaluated against 20 users to demonstrate and conﬁrm its feasibility and beneﬁts. The experimental sports genres were speciﬁcally selected to represent the four main categories of sports domain: period-, set-point-, time (race)-, and performance- based sports. Thus, the proposed system should be generic and robust for all types of sports

Deakin Research Online

Crossref

Queensland University of Technology ePrints Archive